Use of Multiple Features for Extracting Topics from News Clusters

نویسندگان

  • Aleksey Alekseev
  • Natalia V. Loukachevitch
چکیده

In this paper we consider a method for extraction of sets of semantically similar language expressions representing different participants of the text story – thematic nodes. The method is based on the structural organization of news clusters and exploits comparison of various contexts of words. The word contexts are used as a basis for multiword expression extraction and thematic node construction. We evaluate our method on the multi-document summarization task.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Columbia Newsblaster: Multilingual News Summarization on the Web

We propose to show the new multilingual version of the Columbia Newsblaster news summarization system. The system addresses the problem of user access to browsing news in multiple languages from multiple sites on the internet. The system automatically collects, organizes, and summarizes news in multiple source languages, allowing the user to browse news topics with English summaries, and compar...

متن کامل

Columbia's Newsblaster: New Features and Future Directions

Columbia’s Newsblaster tracking and summarization system is a robust system that clusters news into events, categorizes events into broad topics and summarizes multiple articles on each event. Here we outline our most current work on tracking events over days, producing summaries that update a user on new information about an event, outlining the perspectives of news coming from different count...

متن کامل

مقایسه روش‌های مختلف یادگیری ماشین در خلاصه‌سازی استخراجی گفتار به گفتار فارسی بدون استفاده از رونوشت

In this paper, extractive speech summarization using different machine learning algorithms was investigated. The task of Speech summarization deals with extracting important and salient segments from speech in order to access, search, extract and browse speech files easier and in a less costly manner. In this paper, a new method for speech summarization without using automatic speech recognitio...

متن کامل

A Cluster-based Approach to Broadcast News

We present an approach to detection and tracking of topics in multilingual broadcast news based upon a dynamic clustering scheme. Our approach derives from a system used to filter Web searches from multiple sources, with extensions for pipelining document clusters, part-of-speech tagging and extraction of named entities for use in an extended similarity measure.

متن کامل

مطالعۀ الگوهای جمعیت‌شناختی و رفتاری خوانندگان برای اشاعۀ گزینشی اخبار

Purpose: The current research focuses on selective dissemination of news and aims at finding patterns for recognition of readers’ favorite news through web mining technique. Method: Data for this research was collected from the Yahoo News Website. The source of news was Associated Press. 840 news dated between 2011/3/1 and 2011/5/10 was analyzed through subject clustering technique. Findings:...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012